Visual Lip Reading Dataset in Turkish

نویسندگان

چکیده

The promised dataset was obtained from daily Turkish words and phrases pronounced by various people in videos posted on YouTube. purpose of compiling the to provide a method for detection spoken word recognizing patterns or classifying lip movements with supervised, unsupervised, semi-supervised learning, machine learning algorithms. Most datasets related reading consist recorded camera fixed backgrounds same conditions, but presented here consists images compatible models developed real-life challenges. It contains total 2335 instances taken TV series, movies, vlogs, song clips vary due factors such as way say words, accents, speaking rate, gender, age. Furthermore, different angles, shadows, resolution, brightness that are not created manually. most important feature our is we contribute non-synthetic pool, which does have wide varieties. Machine studies can be carried out many areas, education, security, social life this dataset.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Learning Visual Models for Lip Reading

This chapter describes learning techniques that are the basis of a "visual speech recognition" or "lipreading" system 1 • Model-based vision systems currently have the best performance for many visual recognition tasks. For geometrically simple domains, models can sometimes be constructed by hand using CAD-like tools. Such models are difficult and expensive to construct, however, and are inadeq...

متن کامل

Improving visual features for lip-reading

Automatic speech recognition systems that utilise the visual modality of speech often are investigated within a speakerdependent or a multi-speaker paradigm. That is, during training the recogniser will have had prior exposure to example speech from each of the possible test speakers. In a previous paper we highlighted the danger of not using different speakers in the training and test sets, an...

متن کامل

Visual Words for Automatic Lip-Reading

.................................................................................. i ACKNOWLEDGMENT.................................................................... iv ABBREVIATIONS.......................................................................... v CONTENTS................................................................................... viii LIST OF FIGURES...........................

متن کامل

Lip Reading in Profile

There has been a quantum leap in the performance of automated lip reading recently due to the application of neural network sequence models trained on a very large corpus of aligned text and face videos. However, this advance has only been demonstrated for frontal or near frontal faces, and so the question remains: can lips be read in profile to the same standard? The objective of this paper is...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Data

سال: 2023

ISSN: ['2306-5729']

DOI: https://doi.org/10.3390/data8010015